Democratic Republic of the Congo
A Multilingual Sentiment Lexicon for Low-Resource Language Translation using Large Languages Models and Explainable AI
Malinga, Melusi, Lupanda, Isaac, Nkongolo, Mike Wa, van Deventer, Phil
South Africa and the Democratic Republic of Congo (DRC) present a complex linguistic landscape with languages such as Zulu, Sepedi, Afrikaans, French, English, and Tshiluba (Ciluba), which creates unique challenges for AI-driven translation and sentiment analysis systems due to a lack of accurately labeled data. This study seeks to address these challenges by developing a multilingual lexicon designed for French and Tshiluba, now expanded to include translations in English, Afrikaans, Sepedi, and Zulu. The lexicon enhances cultural relevance in sentiment classification by integrating language-specific sentiment scores. A comprehensive testing corpus is created to support translation and sentiment analysis tasks, with machine learning models such as Random Forest, Support Vector Machine (SVM), Decision Trees, and Gaussian Naive Bayes (GNB) trained to predict sentiment across low resource languages (LRLs). Among them, the Random Forest model performed particularly well, capturing sentiment polarity and handling language-specific nuances effectively. Furthermore, Bidirectional Encoder Representations from Transformers (BERT), a Large Language Model (LLM), is applied to predict context-based sentiment with high accuracy, achieving 99% accuracy and 98% precision, outperforming other models. The BERT predictions were clarified using Explainable AI (XAI), improving transparency and fostering confidence in sentiment classification. Overall, findings demonstrate that the proposed lexicon and machine learning models significantly enhance translation and sentiment analysis for LRLs in South Africa and the DRC, laying a foundation for future AI models that support underrepresented languages, with applications across education, governance, and business in multilingual contexts.
Prompting Explicit and Implicit Knowledge for Multi-hop Question Answering Based on Human Reading Process
Huang, Guangming, Long, Yunfei, Luo, Cunjin, Shen, Jiaxing, Sun, Xia
Pre-trained language models (PLMs) leverage chains-of-thought (CoT) to simulate human reasoning and inference processes, achieving proficient performance in multi-hop QA. However, a gap persists between PLMs' reasoning abilities and those of humans when tackling complex problems. Psychological studies suggest a vital connection between explicit information in passages and human prior knowledge during reading. Nevertheless, current research has given insufficient attention to linking input passages and PLMs' pre-training-based knowledge from the perspective of human cognition studies. In this study, we introduce a Prompting Explicit and Implicit knowledge (PEI) framework, which uses prompts to connect explicit and implicit knowledge, aligning with human reading process for multi-hop QA. We consider the input passages as explicit knowledge, employing them to elicit implicit knowledge through unified prompt reasoning. Furthermore, our model incorporates type-specific reasoning via prompts, a form of implicit knowledge. Experimental results show that PEI performs comparably to the state-of-the-art on HotpotQA. Ablation studies confirm the efficacy of our model in bridging and integrating explicit and implicit knowledge.
Machine Learning-based NLP for Emotion Classification on a Cholera X Dataset
Recent social media posts on the cholera outbreak in Hammanskraal have highlighted the diverse range of emotions people experienced in response to such an event. The extent of people's opinions varies greatly depending on their level of knowledge and information about the disease. The documented re-search about Cholera lacks investigations into the classification of emotions. This study aims to examine the emotions expressed in social media posts about Chol-era. A dataset of 23,000 posts was extracted and pre-processed. The Python Nat-ural Language Toolkit (NLTK) sentiment analyzer library was applied to deter-mine the emotional significance of each text. Additionally, Machine Learning (ML) models were applied for emotion classification, including Long short-term memory (LSTM), Logistic regression, Decision trees, and the Bidirectional En-coder Representations from Transformers (BERT) model. The results of this study demonstrated that LSTM achieved the highest accuracy of 75%. Emotion classification presents a promising tool for gaining a deeper understanding of the impact of Cholera on society. The findings of this study might contribute to the development of effective interventions in public health strategies.
US 'strongly condemns' violence in DR Congo after alleged drone attack
The United States has condemned growing violence in the Democratic Republic of the Congo (DRC), blaming an armed group it says is backed by neighbouring Rwanda. Fighting has flared in recent days in the eastern part of the DRC between the M23 rebel group and government forces, resulting in dozens of soldiers and civilians being killed or wounded. The fighting has also pushed tens of thousands of civilians to flee towards the eastern city of Goma, which is located between Lake Kivu and the border with Rwanda. "This escalation has increased the risk to millions of people already exposed to human rights abuses including displacement, deprivation, and attacks," US State Department spokesman Matthew Miller said in a statement. "The United States condemns Rwanda's support for the M23 armed group and calls on Rwanda to immediately withdraw all Rwanda Defense Force personnel from the DRC and remove its surface-to-air missile systems, which threaten the lives of civilians, UN and other regional peacekeepers, humanitarian actors, and commercial flights in eastern DRC," Miller added.
DR Congo accuses Rwanda of airport 'drone attack' in restive east
The Democratic Republic of the Congo has accused Rwanda of carrying out a drone attack that damaged a civilian aircraft at the airport in the strategic eastern city of Goma, the capital of North Kivu province. Fighting has flared in recent days around the town of Sake, 20km (12 miles) from Goma, between M23 rebels โ which Kinshasa says are backed by Kigali โ and Congolese government forces. "On the night of Friday to Saturday, at 2-o-clock in the morning local time, there was a drone attack by the Rwandan army," said Lieutenant-Colonel Guillaume Ndjike Kaito, army spokesperson for North Kivu province. "It had obviously come from the Rwandan territory, violating the territorial integrity of the Democratic Republic of the Congo," he added in a video broadcast by the governorate. The drones "targeted aircraft of DRC armed forces".
Machine Intelligence in Africa: a survey
Tapo, Allahsera Auguste, Traore, Ali, Danioko, Sidy, Tembine, Hamidou
In the last 5 years, the availability of large audio datasets in African countries has opened unlimited opportunities to build machine intelligence (MI) technologies that are closer to the people and speak, learn, understand, and do businesses in local languages, including for those who cannot read and write. Unfortunately, these audio datasets are not fully exploited by current MI tools, leaving several Africans out of MI business opportunities. Additionally, many state-of-the-art MI models are not culture-aware, and the ethics of their adoption indexes are questionable. The lack thereof is a major drawback in many applications in Africa. This paper summarizes recent developments in machine intelligence in Africa from a multi-layer multiscale and culture-aware ethics perspective, showcasing MI use cases in 54 African countries through 400 articles on MI research, industry, government actions, as well as uses in art, music, the informal economy, and small businesses in Africa. The survey also opens discussions on the reliability of MI rankings and indexes in the African continent as well as algorithmic definitions of unclear terms used in MI.
Air Quality Forecasting Using Machine Learning: A Global perspective with Relevance to Low-Resource Settings
Christian, Mulomba Mukendi, Choi, Hyebong
Air pollution stands as the fourth leading cause of death globally. While extensive research has been conducted in this domain, most approaches rely on large datasets when it comes to prediction. This limits their applicability in low-resource settings though more vulnerable. This study addresses this gap by proposing a novel machine learning approach for accurate air quality prediction using two months of air quality data. By leveraging the World Weather Repository, the meteorological, air pollutant, and Air Quality Index features from 197 capital cities were considered to predict air quality for the next day. The evaluation of several machine learning models demonstrates the effectiveness of the Random Forest algorithm in generating reliable predictions, particularly when applied to classification rather than regression, approach which enhances the model's generalizability by 42%, achieving a cross-validation score of 0.38 for regression and 0.89 for classification. To instill confidence in the predictions, interpretable machine learning was considered. Finally, a cost estimation comparing the implementation of this solution in high-resource and low-resource settings is presented including a tentative of technology licensing business model. This research highlights the potential for resource-limited countries to independently predict air quality while awaiting larger datasets to further refine their predictions.
Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
Sikasote, Claytone, Siaminwe, Kalinda, Mwape, Stanly, Zulu, Bangiwe, Phiri, Mofya, Phiri, Martin, Zulu, David, Nyirenda, Mayumbo, Anastasopoulos, Antonios
This work introduces Zambezi Voice, an open-source multilingual speech resource for Zambian languages. It contains two collections of datasets: unlabelled audio recordings of radio news and talk shows programs (160 hours) and labelled data (over 80 hours) consisting of read speech recorded from text sourced from publicly available literature books. The dataset is created for speech recognition but can be extended to multilingual speech processing research for both supervised and unsupervised learning approaches. To our knowledge, this is the first multilingual speech dataset created for Zambian languages. We exploit pretraining and cross-lingual transfer learning by finetuning the Wav2Vec2.0
At least 9 killed in eastern Congo's latest extremist rebel attack
Fox News Flash top headlines are here. Check out what's clicking on Foxnews.com. Extremist rebels in eastern Congo killed at least nine people with knives and guns, a civil society organization said Friday. The attack happened Thursday evening on the Kyondo-Kyavinyonge road in North Kivu province, said Meleki Mulala the coordinator for the Congolese civil society group for the Ruwenzori sector. Civilians were taken from their homes before they were killed, and many homes were looted, he said.
A Comparative Analysis of CNN-Based Pretrained Models for the Detection and Prediction of Monkeypox
Saha, Sourav, Chakraborty, Trina, Sulaiman, Rejwan Bin, Paul, Tithi
Monkeypox is a rare disease that raised concern among medical specialists following the convi-19 pandemic. It's concerning since monkeypox is difficult to diagnose early on because of symptoms that are similar to chickenpox and measles. Furthermore, because this is a rare condition, there is a knowledge gap among healthcare professionals. As a result, there is an urgent need for a novel technique to combat and anticipate the disease in the early phases of individual virus infection. Multiple CNN-based pre-trained models, including VGG-16, VGG-19, Restnet50, Inception-V3, Densnet, Xception, MobileNetV2, Alexnet, Lenet, and majority Voting, were employed in classification in this study. For this study, multiple data sets were combined, such as monkeypox vs chickenpox, monkeypox versus measles, monkeypox versus normal, and monkeypox versus all diseases. Majority voting performed 97% in monkeypox vs chickenpox, Xception achieved 79% in monkeypox against measles, MobileNetV2 scored 96% in monkeypox vs normal, and Lenet performed 80% in monkeypox versus all.